Skip to main content

Data Analysis for Social Scientists

Learn methods for harnessing and analyzing data to answer questions of cultural, social, economic, and policy interest.

Data Analysis for Social Scientists

Learn methods for harnessing and analyzing data to answer questions of cultural, social, economic, and policy interest.

In this course, we will introduce you to the essential notions of probability and statistics. You will learn techniques in modern data analysis with applications drawn from real world examples and frontier research. You will also receive instruction for how to use the statistical package R with opportunities to perform self-directed empirical analyses.

Show More

This course is designed for anyone who wants to learn how to work with data and communicate data-driven findings effectively.

The course is free to audit. Learners can take a proctored exam and earn a course certificate by paying a fee, which varies by ability to pay. Please see our FAQ articles for more information on the certificate and audit track features as well as more information on the pricing structure. Enroll in this course by selecting the "enroll now" button at the top of the page.

This course can be completed by itself or as part of the MITx MicroMasters program in Data, Economics, and Design of Policy (DEDP), which provides a path toward the master’s in DEDP at MIT.

What you'll learn

The course will investigate the following topics:

  • Data analysis in R
  • Fundamentals of probability, random variables, and joint distributions
  • Collecting and describing data
  • Joint and conditional distributions of random variables
  • Joint, marginal, and conditional distributions, and functions of random variables
  • Special distributions, the sample mean, the central limit theorem, and estimation
  • Assessing and deriving estimators, confidence intervals, and hypothesis testing
  • Causality, analyzing randomized experiments, and nonparametric regression
  • Single and multivariate linear models
  • Practical issues in running regressions and omitted variable bias
  • Endogeneity, instrumental variables, and experimental design
  • Machine learning and data visualization

Access the full syllabus here.

Prerequisites

No prior preparation in probability and statistics is required, but familiarity with algebra and calculus is assumed.

Course Readiness Check:

Our course readiness checks help you determine if you should review key concepts before starting the course.

Please use this link to access the course readiness check and answer key.

Meet your instructors

  • Featured image for Esther Duflo
    Abdul Latif Jameel Professor of Poverty Alleviation and Development Economics in the Department of Economics
  • Featured image for Sara Fisher Ellison
    Senior Lecturer

Who can take this course?

Because of U.S. Office of Foreign Assets Control (OFAC) restrictions and other U.S. federal regulations, learners residing in one or more of the following countries or regions will not be able to register for this course: Iran, Cuba, Syria, North Korea and the Crimea, Donetsk People's Republic and Luhansk People's Republic regions of Ukraine.